Perceptual and computational separation of simultaneous vowels: cues arising from low-frequency beating.

نویسندگان

  • J F Culling
  • C J Darwin
چکیده

Identification of simultaneous speech sounds, such as pairs of steady-state vowels (double vowels), is more accurate when there is a difference in fundamental frequency (F0). Accuracy of identification for double vowels increases with increasing F0 difference (delta F0) asymptoting above 1 semitone. The experiment described here attempts to distinguish two mechanisms underlying this effect: first, perceptual separation by grouping together harmonic components of a common F0; and, second, exploitation of the fluctuations in the spectral envelope of the composite stimulus that result from beating between unresolved components. The beating is mainly caused by interactions between corresponding harmonics of the two vowels with a small delta F0. Identification accuracy for normal, harmonically excited double vowels was compared with that for double vowels composed from the same components, but whose constituent vowels were excited by a mixture of the two harmonic series. These double vowels were designed to produce similar beating patterns to the normal double vowels. Both harmonically and inharmonically excited constituents improved identification with increasing delta F0, but the increase was larger for harmonically excited vowels. A computational model based upon psychophysical measurements of auditory frequency and temporal resolution correctly predicted an increase in accuracy of identification with increasing delta F0 which was attributable to beating. The results are interpreted in terms of a spectral change cue in the identification of double vowels with delta F0's which complements grouping by F0, and which plays a dominant role for delta F0's smaller than 1 semitone.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modelling the perception of simultaneous semi-vowels

A model that is able to predict human performance in a simultaneous glide recognition task is described. The model combines a primitive, F0 guided, segregation stage and a schema driven stage with a heuristic that models whether listeners perceive a single or two simultaneous sounds. Introduction Previous studies [1,2,3] suggest that human listeners use simple cues, such as signal harmonicity, ...

متن کامل

Production of English Lexical Stress by Persian EFL Learners

This study examines the phonetic properties of lexical stress in English produced by Persian speakers learning English as a foreign language. The four most reliable phonetic correlates of English lexical stress, namely fundamental frequency, duration, intensity, and vowel quality were measured across Persian speakers’ production of the stressed and unstressed syllables of five English disyllabi...

متن کامل

The perceptual segregation of simultaneous auditory signals: pulse train segregation and vowel segregation.

In the experiments reported here, we attempted to find out more about how the auditory system is able to separate two simultaneous harmonic sounds. Previous research (Halikia & Bregman, 1984a, 1984b; Scheffers, 1983a) had indicated that a difference in fundamental frequency (F0) between two simultaneous vowel sounds improves their separate identification. In the present experiments, we looked a...

متن کامل

Acoustic Analysis of Persian EFL Learners' Pronunciation of English Vowels

This paper reports the results of an experimental study on non-native production of English vowels. Two groups of Persian EFL learners varying in language proficiency were tested on their ability to produce the nine plain vowels of American English. Vowel production accuracy was assessed by means of acoustic measurements. Ladefoged and Maddison’s (1996) F1 F2 measurements for American English v...

متن کامل

بررسی ساختار سازه‌ای واکه‌های زبان فارسی در بزرگ‌سالان دوزبانه آذری فارسی

Objective: Vowels are the center of syllables while formant structures are one of the most important acoustic characteristics of speech sounds that help in their articulatory and perceptual aspects. Formants represent the shape and size of the vocal tract. There exist trivial differences between the vocal tracts of different people due to which the formant structures of a vowel in one person ar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • The Journal of the Acoustical Society of America

دوره 95 3  شماره 

صفحات  -

تاریخ انتشار 1994